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Abstract 

Background: Aortopathies are a group of disorders characterized by aneurysms, dilation, and tortuosity of the 
aorta. Because of the phenotypic overlap and genetic heterogeneity of diseases featuring aortopathy, molecular 
testing is often required for timely and correct diagnosis of affected individuals. In this setting next generation 
sequencing (NGS) offers several advantages over traditional molecular techniques. 

Methods: The purpose of our study was to compare NGS enrichment methods for a clinical assay targeting 
the nine genes known to be associated with aortopathy. RainDance emulsion PCR and SureSelect RNA-bait 
hybridization capture enrichment methods were directly compared by enriching DNA from eight samples. 
Enriched samples were barcoded, pooled, and sequenced on the lllumina HiSeq2000 platform. Depth of coverage, 
consistency of coverage across samples, and the overlap of variants identified were assessed. This data was also 
compared to whole-exome sequencing data from ten individuals. 

Results: Read depth was greater and less variable among samples that had been enriched using the RNA-bait 
hybridization capture enrichment method. In addition, samples enriched by hybridization capture had fewer 
exons with mean coverage less than 10, reducing the need for followup Sanger sequencing. Variants sets 
produced were 77% concordant, with both techniques yielding similar numbers of discordant variants. 

Conclusions: When comparing the design flexibility, performance, and cost of the targeted enrichment methods 
to whole-exome sequencing, the RNA-bait hybridization capture enrichment gene panel offers the better 
solution for interrogating the aortopathy genes in a clinical laboratory setting. 

Keywords: Aortopathy, Hybridization capture, Marfan syndrome, Next generation sequencing (NGS), Target 
enrichment, Emulsion PCR 



Background 

Aortopathies are a group of disorders characterized by 
aneurysms, dilation, and tortuosity of the aorta. Thoracic 
aortic aneurysm with dissection is the most common 
fatal condition involving the aorta [1], and can be 
syndromic, familial nonsyndromic, or sporadic. Over 
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20% of thoracic aneurysms result from inherited disor- 
ders [2], Syndromic connective tissue diseases with 
aortic involvment, such as Marfan syndrome (MFS; 
OMIM# 154700), Loeys-Dietz syndrome (LDS; OMIM# 
609192), Ehlers Danlos syndrome type IV (EDS IV; 
OMIM# 130050), and congenital contractural arachno- 
dactyly (OMIM# 121050) each result from mutations 
in different genes yet have broadly overlapping pheno- 
types [3-6]. In addition, mutations in genes related to 
the structure and function of the aortic wall, including 
MYH11 [7,8], ACTA2 [9], SLC2A10 [10], and NOTCH1 
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[11], have been linked to non-syndromic familial forms 
of thoracic aortic aneurysms (reviewed in [12]). 

Due to the overlapping phenotypes presented by these 
disorders genetic sequencing is often required for accurate 
diagnosis and appropriate clinical intervention. However, 
comprehensive sequencing of the many genes involved 
is often impractical with traditional Sanger methods. In 
contrast, high-throughput "next-generation" sequencing 
(NGS) has emerged as a new tool in the clinical laboratory 
for the rapid, cost-effective detection of mutations in genes 
associated with multigenic disorders [13,14]. NGS assays 
targeted to a panel of genomic regions associated with 
known pathogenic mutations offer several advantages over 
traditional sequencing methods, including lower cost and 
rapid assessment of many regions. 

While NGS offers a promising alternative to Sanger 
sequencing, the clinical utility of NGS methods depends 
on the ability to accurately isolate or amplify the genomic 
regions of interest. Ideally, the enrichment will yield high 
read depths in the targeted regions while keeping errors 
introduced through PCR and other sample manipula- 
tions to a minimum. Several NGS target enrichment 
strategies are currently available, all with various advan- 
tages depending on the size of the targeted region and 
the genetic targets themselves. Previous comparisons of 
enrichment techniques have identified considerable differ- 
ences in the depth of coverage of targeted regions as well 
as the number of variants identified [15-17]. 

In this work we compare two competing enrichment 
protocols, an emulsion-PCR based technique from 
RainDance technologies [18] and a method utilizing 
solution-based RNA-bait hybrid capture marketed as the 
SureSelect Target Enrichment Platform [19]. Using an 
aortopathy panel of nine genes (Table 1), we assess the 
clinical utility of each enrichment technique. We quan- 
tify the coverage of the targeted regions and compare 
the accuracy and overlap of the variants identified. To 
determine which molecular approach is best suited for 
routine clinical diagnostics for this disorder, we also 
compare read depths for the aortopathy genes generated 
from the two custom-designed gene panel enrichments 
to those obtained from whole-exome sequencing. 

Methods 

Samples 

Eight samples were used to directly compare the enrich- 
ment technologies. Four samples were positive controls 
with a known mutation in one of the aortopathy genes 
(Table 2, Samples 1-4). Samples 5 and 6 were from 
patients with a clinical diagnosis of aortopathy and a 
negative molecular analysis of FBN1, TGFftRl, and 
TGFfiR2 by Sanger sequencing analysis and multiplex 
ligation-dependent probe amplification (MLPA) [20]. A 
normal, healthy individuals DNA in which a mutation in 



the aortopathy genes was not expected was used as a 
negative control. A Coriell sample with an FBN1 
c.l888delAAinsC, p.M717X genotype was also evaluated. 

This study was approved by the University of Utah 
and Primary Children's Medical Center Institutional 
Review Boards (IRB#00028740). Written informed con- 
sent for participation in the study was obtained from the 
participants or their parents. 

Emulsion PCR enrichment 

A primer library (RainDance Technologies, Boston, MA) 
was custom-designed to amplify 194 exons and exon/ 
intron boundaries for nine aortopathy genes listed in 
Table 1 (-0.1 Mb). Primers were designed using Rain- 
Dances design parameters and Primer3 (http://primer3. 
sourceforge.net/). The 350 PCR amplicons ranged in size 
from 201-919 bp and had a guanine cytosine (GC) con- 
tent of 24-80%. Genomic DNA (1.5-3 ug) was sheared 
to 2-4 kb fragments using a Covaris S2 instrument 
(Covaris, Woburn, MA) and added to a mixture that 
included all the components of the PCR reaction exclud- 
ing the primers. This mixture and the primer library 
were loaded separately onto the RDT1000 instrument, 
and PCR droplets containing one primer pair per droplet 
were generated. After amplification, emulsion PCR dro- 
plets were broken releasing the amplicons which were 
then purified and concatenated according to the manu- 
facturers instructions. Concatenated samples were 
sheared to 300 bp fragments, and Illumina adapters were 
added using the SPRI-TE instrument (Beckman-Coulter, 
Danvers, MA). Barcode indexes were added using PCR, 
and sample quality and quantity was assessed using a 
Bioanalyzer (Agilent Technologies, Santa Clara, CA). 

Solution-based hybridization capture enrichment 

Custom RNA baits were designed to specifically target 
the exons and exon/intron boundaries of the nine aorto- 
pathy genes from Table 1 (-0.1 Mb). RNA baits were 
tiled at 5 x spacing and were in replicates of 10 to in- 
crease hybridization efficiency. Genomic DNA (3 ug) 
was sheared to 180 bp fragments. Illumina adapters were 
added using SureSelect XT kit reagents (Agilent Tech- 
nologies, Santa Clara, CA). Adapter ligated DNA under- 
went hybridization with the biotinylated RNA baits for 
24 hours at 65°C. Hybridized DNA targets of interest 
were captured using streptavidin-coated magnetic beads. 
DNA targets of interest were eluted and barcode/ 
indexed after a series of washes to remove the non- 
targeted, unbound genome. DNA quality and quantity 
was assessed (Bioanalyzer). 

NGS sequencing and data analysis 

Concentrations of indexed samples from the enrich- 
ments were determined by quantitative- PCR (KAPA 
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Table 1 Aortopathy panel genes 



Gene name 


Chromosomal 
locus 


Protein 


Disease 


No. reported 
mutations 


Transcript 
size 


Exons 


FBN1 


15q21.1 


Fibrillin 


Marfan syndrome 


601 


8616 bp 


66 


TGFBR1 


9q33-q34 


Transforming growth factor, 
beta receptor 1 


Loeys-Dietz syndrome; 
Marfan-like syndrome 


28° 


1512 bp 


9 


TGFBR2 


3p22 


Transforming growth factor, 
beta receptor 2 


Loeys-Dietz syndrome; 
Marfan-like syndrome 


105° 


1779 bp 


7 
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MYH11 


16p13.13-p13.12 


Myosin heavy chain 1 1 


TAAD b -patent ductus 
arteriosus 


3 


5919 bp 


41 


ACTA2 


10q22-q24 


smooth muscle actin, alpha 2 


TAAD4 syndrome 


19 


1134 bp 


9 


SLC2A10 


20q13.1 


solute carrier family 2 (facilitated 
glucose transporter), member 10 


Arterial Tortuosity 
syndrome 


18 


1626 bp 


5 


N0TCH1 


9q34.3 


Notch homolog 1, 
translocation-associated 


BAV-TAAD syndrome 


11 


7668 bp 


34 


FBN2 


5q23-q31 


Fibrillin 2 


BAV-TAAD syndrome 


45 


8739 bp 


71 



a Of the 28 patients with a TGFBR1 mutation, 14 had Loeys-Dietz syndrome, 4 had thoracic aortic aneurysms, and 4 had Marfan-like syndrome. Of the 105 TGFBR2 
mutations, 32 had Loeys-Dietz syndrome, 31 had Marfan-like syndrome, and 8 had thoracic aortic aneurysms. 
^Thoracic aortic aneurysms and dissections (TAAD); Bicuspid-Aortic Valve (BAV). 



Biosystems, Woburn, MA). Samples were pooled and 
sequenced on a HiSeq2000 instrument (Illumina, San 
Diego, CA) using 2 x 100 paired-end sequencing. 
Sequences were aligned to the human genome reference 
(hgl9) sequence using the Burrows-Wheeler Alignment 
tool (BWA 0.5.9) [21] with default parameters. PCR dupli- 
cates were removed using the Samtools package [22], and 
base quality score recalibration, local realignment, and 
variant calling were performed with the Genome Analysis 
Toolkit (GaTK vl.3) [23]. Variants with coverage less than 
6, with 'quality-by-depth' scores of less than two, with 
variant allele frequencies of less than 0.15, or with overall 
quality scores of less than 10 were discarded. 

Sanger sequencing 

Select variants were Sanger sequenced. Primers 
sequences are available upon request. Amplicons were 
bi-directly sequenced using the Big Dye® Terminator 
v3.1 cycle sequencing kit and an ABI 3730 DNA 
Analyzer (Life Technologies, Carlsbad, CA). Sequences 
were compared to reference sequences using Mutation 
Surveyor (SoftGenetics, State College, PA). 

Exome sequencing 

In addition to the custom targeted enrichments, we also 
examined depth of coverage for the targeted regions 
obtained through exome sequencing. Briefly, ten samples 
from individuals without known aortopathies were 
sequenced on the Illumina HiSeq2000 platform after 
undergoing exome capture enrichment (SureSelect 
50Mb exome kit, Agilent Technologies, Santa Clara, 
CA). Exome samples were indexed and pooled, two per 
lane, using a one-to-one ratio, and were sequenced using 
2xl00-base pair paired end reads. 



Results 

Sample enrichment 

Six of eight samples underwent successful enrichment 
using both RainDance emulsion PCR and SureSelect 
hybridization capture (Table 2). Sample 7 (from Coriell) 
failed during the RDT1000 merge when the DNA dro- 
plets failed to merge properly with the primer library 
droplets. Hybridization capture of this sample was also 
poor and analysis was not completed due to sub-optimal 
DNA quality (A260/A280 was n °t between 1.8 and 2.0). 
Sample 8 failed during hybridization capture due to a 
pipetting error. Herein, the accuracy of the two enrich- 
ment methods, depth of coverage, and variant calling 
were compared among the six samples enriched using 
both techniques. 

Accuracy 

Pathogenic mutations from four positive control samples 
were detected using both enrichment methods (Table 2, 
samples 1-4). For example, a heterozygous FBN1 
C.1585C > T, p.R529X mutation was detected in sample 1 
using both enrichment methods and was confirmed 
using Sanger sequencing (Additional file 1: Figure SI). 

Depth of coverage 

Omitting failed samples, both methods generated very 
high coverage in the targeted regions. Pooling data from 
all samples and examining coverage on a gene-by-gene 
basis, the SureSelect enriched material yielded slightly 
higher, but also more variable, coverage (per-gene aver- 
age 471.7, range 189.9-592.0) compared to the emulsion 
PCR technique (per-gene average 414.8, range 303.4- 
554.4). Both methods yielded significantly higher aver- 
age coverage per gene than whole- exome sequencing 



Table 2 NGS accuracy sample results 



# 


Sample 


Mutations 


Coverage RD° 
(% mutant allele) 


Coverage SS° 
(% mutant allele) 


Mean coverage 


Other information 


1 


FBN1 Positive 


C.15850T, p.R529X 


101x (47%) 


576x (60%) 


1477x (RD) 508x (SS) 


In COL3A1, c.198A>G, p.l66M, 
a novel change. 


2 


TGFBR1 Positive 


c.799A>G, p.N267D 


31 7x (51%) 


504x (58%) 


462x (RD) 563x (SS) 




3 


TGFBR2 Positive 


c.1583G>A, p.R528H 


183x (48%) 


257x (50%) 


1 77x (RD) 499x (SS) 




4 


MY7-/7 7 Positive 


IVS32+1C>A (splice) 


71 x (58%) 


80x (50%) 


140x (RD) 164x (SS) 




5 


Symptomatic unknown 


No mutations found. 


NA 


NA 


33x (RD) 260x (SS) 


In SLC2A10 p.A206T (rs#2235491) 

and p.A385G (rs#79849424). 

In NOTCH1, IVS16-4C>CT (rs#31 25001 ). 


6 


Symptomatic unknown 


COL3A1 IVS9-7T>C novel change; 
potential splice site variant 


48x (56%) 


41 6x (50%) 


86x (RD) 355x (SS) 


In COL3A1, p.A698T (rs#1 800255). 
In FBN2, p.V965l (rs#1 54001). 


7 


FBN1 Coriell positive 


Unable to confirm due to 
sample quality. 


Failed merge between 
DNA and primer library 
droplets 


Poor data due to 
low quality DNA 


NA 




8 


Healthy Control 


No mutations found 


NA 


Failed due pipetting error 


340x (RD) 





a RainDance (RD) and SureSelect (SS). All variants reported in this table are heterozygous as confirmed by Sanger sequencing (data not shown). NA, not applicable. 
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(Figure 1A) and had substantially fewer exons with less 
than tenfold coverage (Figure IB). SureSelect results 
were influenced by particularly low coverage for the 
NOTCH 1 gene (mean 189x), a feature not observed with 
the sequences generated by emulsion PCR. Low cover- 
age may be related to the GC content of the NOTCH 1 
gene, in which 17 of 34 exons are > 65% GC. 

Examining exons individuals, the SureSelect enrich- 
ment yielded fewer total exons with coverage below 10 
when compared to both RainDance and the whole 
exome capture (Figure IB). Some 98.5% of all exons had 
mean depth greater than 50 for SureSelect, as compared 
to 85.5% for RainDance and 70% for whole exome. 

Despite relatively high read depths across most exons, 
several exons consistently had little or no coverage 
among the samples. Within the SureSelect-enchriched 
sequences, much of the variation appears to be related to 



the GC content of the surrounding sequence (Figure 2). 
For exons containing greater than 75% GC (for instance, 
exon 1 in the NOTCH 1 and TGFfiRl genes) no coverage 
was observed. 

In contrast to the strong correlation of GC content 
with read depth for the hybridization capture enrich- 
ment, exon coverage varied in an unpredictable manner 
for the emulsion PCR enrichment method (Figure 3). To 
assess the degree of coverage variability across samples, 
we computed the standard deviation in coverage for 
each exon across samples for both custom-designed 
panel enrichment techniques. The mean of these devia- 
tions across exons reflects the overall degree of coverage 
variability across all samples, with zero indicating that 
each exon was covered by the same number of reads in 
every sample. Between sample variability was signifi- 
cantly higher for the emulsion PCR enrichment, with a 
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□ RainDance 
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Coverage < 10 Coverage 10-50 Coverage > 50 

Figure 1 Mean depth of coverage per aortopathies gene for the enrichment methods. In A, both custom-designed gene panels 
(SureSelect and RainDance) yielded significantly higher average coverage per gene compared to whole-exome sequencing (SureSelect 50 Mb All 
Exon Capture). Bars indicate one standard deviation. In B, the fraction of exons covered to the given depth for the enrichment methods is 
shown. The custom-designed SureSelect gene panel yielded greater average read depths across the 294 exons for all samples versus the 
RainDance or whole-exome enrichments. 
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Reference GC Content 



■ SureSelect 
♦ RainDance 



Figure 2 Mean exon read depth as a function of reference sequence GC content for the custom-designed gene panels. High GC content 
resulted in decreased exon coverage in the SureSelect enriched samples. GC content had little to no effect on exon coverage of the RainDance 
enriched samples, which were more prone to sporadic amplicon failure. 



mean of 330.4, compared to 215.8 for the hybridization 
enrichment panel (paired £-test, p < 2.2 x 10~ 16 , Figure 4). 

Variants analysis 

In addition to the number of reads mapping to targeted 
locations, we also assessed the quantity and character of 
the variants identified by both enrichment methods. Be- 
cause both methods targeted overlapping, but partially 
distinct regions, we considered only those variants in the 
intersection of the two capture regions. The intersection 
of the two capture regions was 60.58 Kb in size, com- 
prising 64% of the bases targeted by SureSelect and 99% 
of bases in the RainDance capture. 

In total, 252 unique variants were identified across all 
samples, 195 of which were identified by both methods 



(Additional file 1: Table SI). The sequences generated 
by the RainDance enrichment contained several more 
variants, 221 in total, including 26 not identified in 
the SureSelect-enriched sequences. In contrast, the 
SureSelect protocol yielded 214 total variants, of which 
19 were not found using the emulsion PCR tech- 
nique. Among the variants common to both methods, 
55 (28.2%) were in dbSNP (v. 132), while 2 of the 17 var- 
iants (12%) unique to SureSelect were in dbSNP, and 
3 of the 23 variants (13%) unique to emulsion PCR were 
in dbSNP. Assuming variants found in dbSNP are not 
false positives in our samples, this suggests that even the 
relatively high coverage obtained by both enrichment 
methods did not guarantee accurate variant detection, as 
both the custom targeted enrichment methods identified 




1 2 3 4 S 6 1 2 3 4 5 6 12 34S6789 

Emulsion PCR Hybridization capture Whole exome sequencing 

Figure 3 Number of exons with mean read depth less than 20, across samples (x-axis) and genes (y-axis). The color of the block 
represents the number of exons with low coverage for a given sample and gene. 
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SureSelect RainDance 

Figure 4 Variability in coverage: RainDance versus SureSelect. 

Standard deviation in coverage for each exon across enrichment 
samples was computed. The mean deviation across exons reflects 
the overall degree of coverage variability across samples 1-6, with 
zero indicating that each exon was covered by the same number of 
reads in every sample. Between sample variability was significantly 

higher for the RainDance enrichment. 

\ J 



several variants not recognized by the other method. 
Similarly discordant results have been found in other 
recent studies [13]. 

Examination of the discordant variants suggested sev- 
eral sources for the disagreement. First, discordant var- 
iants appeared in the intronic and untranslated areas at 
the periphery of individual capture regions. Ten SureSe- 
lect and fifteen RainDance discordant variants were 
located in non-exonic regions. Examination of read 
depth in these areas suggested that coverage was rela- 
tively low compared to the mean for the whole exon, im- 
plicating variation in read depth as a potential source of 
discordant or false positive calls. 

A second source of discordant variation was the 
NOTCH 1 gene. A relatively high fraction of the discord- 
ant variants appeared in the exons of NOTCH 1 (15 of 
45, or 33%), and these appeared with similar frequencies 
in both the RainDance and SureSelect samples (8 from 
RainDance, 7 from SureSelect). NOTCH 1 is likely to be 
problematic for two reasons. First, the high GC content 
of NOT CHI reduces the read depths obtained with the 
SureSelect enrichment procedure. Second, NOTCH 1 
bears substantial sequence homology to other NOTCH 
genes, potentially confounding alignment algorithms and 
generating spurious variant calls. 

Clinical utility of the aortopathy panel 

Two clinically affected individuals who had previously 
tested negative for a FBN1, TGFBR1, and TGFBR2 



mutation by Sanger sequencing and MLPA [20] were 
evaluated using the aortopathy panel. Sample 5 was from 
a 19-year-old female with borderline aortic dilatation, 
mild tricuspid valve prolapse and mild mitral valve insuf- 
ficiency, height of 178 cm (5' 10") (99 th percentile), arm 
span of 176 cm (5'9") severe pectus carinatum, scoliosis, 
arachnodactyly, joint mobility, positive wrist and thumb 
signs, and skin striae. Analysis of the coding region 
variants and splice site variants from both enrichment 
methods revealed two nonsynonymous heterozygous 
variants in SLC2A10 (c.616G > A, p.Ala206Thr and 
c.H54C>G, p. Ala385Gly) and a NOTCH1 potential 
splice site variant (IVS16-4C > T). These variants were 
present in dbSNP and are likely not disease causing. 

Sample 6 was from a 34-year-old female with mild aortic 
dilatation in the setting of past aortic dissection and un- 
known family history. Two heterozygous nonsynonymous 
variants, one in COL3A1 (c.2092G > A, p.Ala698Thr) and 
the other in FBN2 (c.2893G > A, p.Val965Ile) were both 
present in dbSNP, rsl800255 and rsl54001 at 19.8% 
and 72.4% frequencies in 1000 genomes (November 2010 
release, 1000genomes.org), respectively. A novel heterozy- 
gous COL3A1 variant (IVS9-7T > C) not found in dbSNP 
or the COL3A1 locus specific mutation database was also 
identified. Splice site prediction programs were run to 
determine if the change altered splicing [24], but they pre- 
dicted that no change in splicing would occur. It is likely 
that this variant is also not disease causing. 

No synonymous splice site variants were identified in 
either patient. It is possible that these two patients have 
a large deletion or duplication of one of the nine aorto- 
pathy genes, but this has yet to be evaluated. Currently, 
detection of structural variation cannot be reliably 
detected from NGS data alone. Patient samples that test 
negative from the NGS assay should be run on an exonic 
level comparative genomic hybridization (CGH) array 
assay used to evaluate large copy number changes in the 
same gene set. Until NGS data analysis programs im- 
prove in their ability and accuracy to detect structural 
variation, CGH array assays are the suggested approach 
for detecting such large deletions or duplications in multi- 
gene panels. Alternatively, another gene not included in 
the aortopathy panel may be responsible for the disorder 
in these patients. 

Discussion 

Enrichment performance 

All targeted enrichment methods evaluated yielded very 
high read depths in the aortopathies targeted regions, 
with all genes except for NOTCH1 from the exome 
capture obtaining a mean depth of coverage of over 50 x 
(Figure 1). Overall, the custom-designed hybridization 
capture enrichment yielded lower coverage in regions 
of high GC content (Figure 2), but consistently high 
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coverage in regions of low GC bias. The emulsion PCR 
method, in contrast, yielded exons with very low cover- 
age that had no obvious correlate (Figure 2). Although 
influenced by GC content, the hybridization capture 
nonetheless yielded fewer total exons with low coverage 
than did the emulsion PCR technique (Figure IB). Over- 
all, emulsion PCR yielded significantly higher between- 
sample variability in coverage. In a clinical setting, a high 
degree of between sample consistency is desirable. Low 
coverage exons will likely require follow-up Sanger 
sequencing. Knowing which exons will require such 
treatment beforehand may ameliorate the cost and add- 
itional time resulting from designing new primer sets on 
an as-needed basis. 

Despite satisfactory coverage, the methods yielded only 
partially overlapping sets of variants, with some 10% of 
variants identified by one method but not by the other 
(Additional file 1: Table SI). These inconsistencies may 
be related to variability in read depth as well as align- 
ment ambiguity related to sequence homology. Similarly 
high discordancy rates were found in a recent study 
comparing the RainDance and Fluidigm enrichment 
platforms [13], in which only 42% of variants identified 
were common to both methods, and 67% of the discord- 
ant variants were found in regions of less than 20-fold 
coverage. Until procedures are devised that substantially 
decrease the number of regions with low coverage, 
Sanger confirmation of suspected variants will likely be 
necessary prior to clinical action. 

A high sample failure rate of 25% (2 of 8 samples) 
was observed due to sample quality (sample 7, Rain- 
Dance and SureSelect) and a pipetting error (sample 8, 
SureSelect) (Table 2). Based on our results, only DNA 
samples of high quality (A 2 6o/280 is between 1.8 and 2.0) 
should be evaluated using NGS enrichment. If a sample 
falls below this initial quality measurement, DNA should 
be extracted again and/ or the sample should be purified 
until an acceptable quality is obtained. For complex 
enrichment protocols such as the SureSelect method, 
pipetting errors can be avoided by incorporating auto- 
mation into sample preparation (discussed further 
below). This is especially important as the number of 
clinical samples increase. 

NGS enrichment in the clinical laboratory 

When compared to emulsion PCR, the solution-based 
hybridization technique yielded higher coverage as well 
as greater predictability in performance. These attri- 
butes, when combined with lower cost, strongly favor 
the hybridization capture for enrichment of the aortopa- 
thy panel in the clinical setting. Emulsion PCR enrich- 
ment is significantly more expensive and requires the 
use of a costly instrument. Exome sequencing is also 
more costly because the cost to sequence the exome at a 



high enough depth of coverage to accurately call variants 
is much higher than for the targeted panel. Samples can 
also be indexed and pooled prior to hybridization capture 
[25], reducing the overall cost per sample even further. 

Despite the lower cost, the hybridization capture 
protocol is more time consuming, complex, and labor 
intensive when compared to the emulsion PCR tech- 
nique. However, the ability to automate this process in 
the clinical laboratory alleviates these challenges and 
yields more consistent, reliable results [26]. Currently, 
the emulsion PCR instrument processes one sample/hour, 
limiting the total number of samples processed in a day 
to approximately twenty-four. With the increasing inter- 
est in exome sequencing, automation of hybridization 
capture enrichment protocols have become readily avail- 
able [26], and up to 96 different custom-designed NGS 
panels and exome hybridizations can take place at the 
same time during the same run. Solution-based hybridi- 
zation libraries can be ordered in larger volumes, ali- 
quoted for up to two-three freeze/thaw cycles, and stored 
for up to one year. RainDance primer libraries have to be 
switched out after the eighth sample with the RDT1000 
instrument design, and if there are not eight samples to 
run using the same primer library, the rest will be wasted. 

Conclusions 

The aortopathy panel offers a cost-effective, faster mo- 
lecular diagnostic assay compared to the conventional 
gene-by-gene Sanger sequencing approach. Both Rain- 
Dance and SureSelect enrichment methods accurately 
identified variants in the positive samples assayed, and 
overall coverage was high across most exons for all sam- 
ples. Hybrid capture results demonstrated that genes with 
high homology to other genes and high GC rich regions 
can be problematic. However, we note that increases in 
read length, currently available with the Ion Torrent or 
Alumina MiSeq instruments, are likely to alleviate 
homology-related alignment issues. After comparing the 
cost, design flexibility, and versatility of the workflows of 
the two enrichment methods, the custom-designed 
hybridization capture design offers the best solution for 
enriching the aortopathy genes in a clinical laboratory set- 
ting. Patients with Marfan syndrome and Marfan-like syn- 
dromes featuring aortopathies will benefit from the timely 
molecular diagnosis of this new testing approach which 
will lead to the appropriate surveillance and interventions 
aimed at preventing the significant morbidity and mortal- 
ity associated with these conditions. 

Additional file 



Additional file 1: Figure SI. NGS and Sanger sequencing confirm the 
pathogenic FBN1 mutation in sample 1. A heterozygous nonsense 
mutation (c.1585C>T, p.R529X) was detected using RainDance PCR 
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enrichment (panel A) and SureSelect capture enrichment (panel B). In C, 
Sanger sequencing confirmed the mutation detected in both enrichment 
strategies. The FBN1 gene is on the reverse strand, and appears in the 3' 
to 5' orientation in the NGS traces versus the Sanger sequencing trace 
which is 5' to 3'. Table SI. Summary of variants detected from the 
RainDance and SureSelect enrichments. 



Competing interests 

We have no financial, nonfinancial, or competing interests to declare. 
Authors' contributions 

WLWD participated in the design of the study, performed all experiments, 
assisted in data analysis, and wrote the manuscript; BO analyzed the data 
and wrote the manuscript; LVF participated in the design of the study and 
wrote the manuscript; JDD and PGR assisted in data analysis; PP assisted 
with experiments and data analysis; AFR and AT/ were responsible for 
diagnosis and management of patients and participated in the design of the 
study; PBT conceived the study, and participated in its design and 
coordination. All authors read and approved the final manuscript. 

Acknowledgments 

We thank Lin Pham, Ph.D. for her assistance in the design of the RainDance 
library. We thank Owen Hardy, Maria Celeste-Ramirez, and Christopher 
Hopkins, Ph.D. for providing assistance in the design of the SureSelect library. 
This work was funded by the ARUP Institute for Clinical and Experimental 
Pathology. 

Author details 

1 ARUP Institute for Clinical and Experimental Pathology, Salt Lake City, USA. 
department of Pathology, University of Utah, Salt Lake City, USA. 
department of Pediatrics, Division of Medical Genetics, University of Utah, 
Salt Lake City, USA. department of Pediatrics, Division of Cardiology, 
University of Utah, Salt Lake City, USA. 5 Molecular Genetics Department, 
ARUP Institute for Clinical and Experimental Pathology, 500 Chipeta Way, Salt 
Lake City, UT 84108, USA. 

Received: 18 July 2012 Accepted: 9 October 2012 
Published: 14 November 2012 

References 

1 . Klein DG: Thoracic aortic aneurysms. J Cardiovasc Nurs 2005, 4:245-250. 

2. Verloes A, Sakalihasan N, Koulischer L, Limet R: Aneurysms of the 
abdominal aorta: familial and genetic aspects in three hundred thirteen 
pedigrees. J Vase Surg 1995, 21:646-655. 

3. Law C, Bunyan D, Castle B, Day L, Simpson I, Westwood G, Keeton B: 
Heterozygous TGFBR2 mutations in Marfan syndrome. Nat Genet 2004, 
36:855-860. 

4. Loeys BL, Chen J, Neptune ER, Judge DP, Podowski M, Holm T, Meyers J, 
Leitch CC, Katsanis N, Sharifi N, Xu FL, Myers LA, Spevak PJ, Cameron DE, 
De Backer J, Hellemans J, Chen Y, Davis EC, Webb CL, Kress W, Coucke P, 
Rifkin DB, De Paepe AM, Dietz HC: A syndrome of altered cardiovascular, 
craniofacial, neurocognitive and skeletal development caused by 
mutations in TGFBR1 or TGFBR2. Nat Genet 2005, 37:275-281. 

5. Pannu H, Fadulu VT, Chang J, Lafont A, Hasham SN, Sparks E, Giampietro PF, 
Zaleski C, Estrera AL, Safi HJ, Shete S, Willing MC, Raman CS, Milewicz DM: 
Mutations in transforming growth factor-(3 receptor type II cause familial 
thoracic aortic aneurysms and dissections. Circulation 2005, 112:513-520. 

6. Gupta PA, Wallis DD, Chin TO, Northrup H, Tran-Fadulu VT, Towbin JA, 
Milewicz DM: FBN2 mutation associated with manifestations of Marfan 
syndrome and congenital contractural arachnodactyly. J Med Genet 2004, 
41:e56. 

7. Zhu L, Vranckx R, Khau Van Kien P, Lalande A, Boisset N, Mathieu F, 
Wegman M, Glancy L, Gasc JM, Brunotte F, Bruneval P, Wolf JE, Michel JB, 
Jeunemaitre X: Mutations in myosin heavy chain 11 cause a syndrome 
associating thoracic aortic aneurysm/aortic dissection and patent ductus 
arteriosus. Nat Genet 2006, 38:343-39. 

8. Khau Van Kien P, Mathieu F, Zhu L, Lalande A, Betard C, Lathrop M, 
Brunotte F, Wolf JE, Jeunemaitre X: Mapping of familial thoracic aortic 
aneurysm/dissection with patent ductus arteriosus to 1 6p1 2.2-pl 3.1 3. 
Circulation 2005, 112:200-206. 



9. Guo DC, Pannu H, Tran-Fadulu V, Papke CL, Yu RK, Avidan N, Bourgeois S, 
Estrera AL, Safi HJ, Sparks E, Amor D, Ades L, McConnell V, Willoughby CE, 
Abuelo D, Willing M, Lewis RA, Kim DH, Scherer S, Tung PP, Ahn C, Buja LM, 
Raman CS, Shete SS, Milewicz DM: Mutations in smooth muscle a-actin 
(ACTA2) lead to thoracic aortic aneurysms and dissections. Nat Genet 
2007, 39:1488-1493. 

10. Coucke PJ, Willaert A, Wessels MW, Callewaert B, Zoppi N, De Backer J, 
Fox JE, Mancini GM, Kambouris M, Gardella R, Facchetti F, Willems PJ, 
Forsyth R, Dietz HC, Barlati S, Colombi M, Loeys B, De Paepe A: Mutations 
in the facilitative glucose transporter GLUT10 alter angiogenesis and 
cause arterial tortuosity syndrome. Nat Genet 2006, 38:452-457. 

1 1 . Garg V, Muth AN, Ransom JF, Schluterman MK, Barnes R, King IN, Grossfeld 
PD, Srivastava D: Mutations in NOTCH 1 cause aortic valve disease. 
Nature 2005, 437:270-274. 

12. El-Hamamsy I, Yacoub MH: Cellular and molecular mechanisms of 
thoracic aortic aneurysms. Nat Rev Cardiol 2009, 6:771-786. 

13. Jones MA, Bhide S, Chin E, Ng BG, Rhodenizer D, Zhang VW, Sun JJ, 
Tanner A, Freeze HH, Hegde MR: Targeted polymerase chain 
reaction-based enrichment and next generation sequencing for 
diagnostic testing of congenital disorders of glycosylation. Genet Med 
2011, 13:921-932. 

14. Berg JS, Evans JP, Leigh MW, Omran H, Bizon C, Mane K, Knowles MR, Week 
KE, Zariwala MA: Next generation massively parallel sequencing of 
targeted exomes to identify genetic mutations in primary ciliary 
dyskinesia: implications for application to clinical testing. Genet Med 
2011, 13:218-229. 

15. Teer JK, Bonnycastle LL, Chines PS, Hansen NF, Aoyama N, Swift AJ, 
Abaan HO, Albert TJ, Comparative Sequencing Program NISC, Margulies EH, 
Green ED, Collins FS, Mullikin JC, Biesecker LG: Systematic comparison of 
three genomic enrichment methods for massively parallel DNA 
sequencing. Genome Res 2010, 20:1420-1431. 

16. Kiialainen A, Karlberg O, Ahlford A, Sigurdsson S, Lindblad-Toh K, 
Syvanen AC: Performance of microarray and liquid based capture 
methods for target enrichment for massively parallel sequencing and 
SNP discovery. PLoS One 201 1, 6:e 16486. 

17. Hedges DJ, Guettouche T, Yang S, Bademci G, Diaz A, Andersen A, 
Hulme WF, Linker S, Mehta A, Edwards YJ, Beecham GW, Martin ER, 
Pericak-Vance MA, Zuchner S, Vance JM, Gilbert JR: Comparison of three 
targeted enrichment strategies on the SOLiD sequencing platform. 
PLoS One 201 1, 6:e1 8595. 

18. Tewhey R, Warner JB, Nakano M, Libby B, Medkova M, David PH, 
Kotsopoulos SK, Samuels ML, Hutchison JB, Larson JW, Topol EJ, Weiner MP, 
Harismendy O, Olson J, Link DR, Frazer KA: Microdroplet-based PCR 
enrichment for large-scale targeted sequencing. Nat Biotechnol 2009, 
27:1025-1031. 

19. Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust EM, Brockman W, 
Fennell T, Giannoukos G, Fisher S, Russ C, Gabriel S, Jaffe DB, Lander ES, 
Nusbaum C: Solution hybrid selection with ultra-long oligonucleotides 
for massively parallel targeted sequencing. Nat Biotechnol 2009, 
27:182-189. 

20. Furtado LV, Wooderchak-Donahue W, Rope AF, Yetman AT, Lewis T, Plant P, 
Bayrak-Toydemir P: Characterization of large genomic deletions in the 
FBN1 gene using multiplex ligation-dependent probe amplification. 
BMC Med Genet 2011, 12:119. 

21. Li H, Durbin R: Fast and accurate short read alignment with 
Burrows-Wheeler transform. Bioinformatics 2009, 25:1754-1760. 

22. Li H, Ruan J, Durbin R: Mapping short DNA sequencing reads and 
calling variants using mapping quality scores. Genome Res 2008, 
18:1851-1858. 

23. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, 
Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA: The Genome 
Analysis Toolkit: a MapReduce framework for analyzing next-generation 
DNA sequencing data. Genome Res 2010, 20:1297-1303. 

24. Cartegni L, Wang J, Zhu Z, Zhang MQ, Krainer AR: ESEfinder: a web 
resource to identify exonic splicing enhancers. Nucelic Acids Res 2003, 
31:3568-3571. 

25. Bell CJ, Dinwiddie DL, Miller NA, Hateley SL, Ganusova EE, Mudge J, 
Langley RJ, Zhang L, Lee CC, Schilkey FD, Sheth V, Woodward JE, 
Peckham HE, Schroth GP, Kim RW, Kingsmore SF: Carrier testing for severe 
childhood recessive diseases by next generation sequencing. Sci Transl 
Med 2011, 3:65 ra4. 



Wooderchak-Donahue et at. BMC Medical Genomics 2012, 5:50 
http://www.biomedcentral.eom/1755-8794/5/50 



Page 10 of 10 



26. Fisher S, Barry A, Abreu J, Minie B, Nolan J, Delorey TM, Young G, Fennell TJ, 
Allen A, Ambrogio L, Berlin AM, Blumenstiel B, Cibulskis K, Friedrich D, 
Johnson R, Juhn F, Reilly B, Shammas R, Stalker J, Sykes SM, Thompson J, 
Walsh J, Zimmer A, Zwirko Z, Gabriel S, Nicol R, Nusbaum C: A scalable, 
fully automated process for construction of sequence-ready human 
exome targeted capture libraries. Genome Biol 201 1, 12:R1. 



doi:1 0.1 186/1755-8794-5-50 

Cite this article as: Wooderchak-Donahue et al:. A direct comparison of 
next generation sequencing enrichment methods using an aortopathy 
gene panel- clinical diagnostics perspective. BMC Medical Genomics 2012 
5:50. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at f~\ RiftMM i rpntral 

www.biomedcentral.com/submit momea central 



